Overcoming language priors via shuffling language bias for Robust Visual Question Answering

نویسندگان

چکیده

Recent research has revealed the notorious language prior problem in visual question answering (VQA) tasks based on visual-textual interaction, which indicates that well-developed VQA models rely learning shortcuts from questions without fully considering evidence.To tackle this problem, most existing methods focus decreasing incentive to learn knowledge by adding a question-only branch and becoming complacent mechanically improving accuracy. However, these over-correct positive biases useful for generalization, leading degradation of performance v2 dataset when cumulating their into other architecture. In paper, we propose robust shuffling bias (SLB) approach explicitly balance prediction distribution, hopefully alleviating increasing training opportunities models.Experiment results demonstrate our method is cumulative with data augmentation large-scale pre-training architectures achieves competitive both in-domain benchmark out-of-distribution VQA-CP v2.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering

A number of studies have found that today’s Visual Question Answering (VQA) models are heavily driven by superficial correlations in the training data and lack sufficient image grounding. To encourage development of models geared towards the latter, we propose a new setting for VQA where for every question type, train and test sets have different prior distributions of answers. Specifically, we...

متن کامل

Using Generalized Language Model for Question Matching

Question and answering service is one of the popular services in the World Wide Web. The main goal of these services is to finding the best answer for user's input question as quick as possible. In order to achieve this aim, most of these use new techniques foe question matching. . We have a lot of question and answering services in Persian web, so it seems that developing a question matching m...

متن کامل

Language Independent Passage Retrieval for Question Answering

Passage Retrieval (PR) is typically used as the first step in current Question Answering (QA) systems. Most methods are based on the vector space model allowing the finding of relevant passages for general user needs, but failing on selecting pertinent passages for specific user questions. This paper describes a simple PR method specially suited for the QA task. This method considers the struct...

متن کامل

Natural Language Annotations for Question Answering

This paper presents strategies and lessons learned from the use of natural language annotations to facilitate question answering in the START information access system.

متن کامل

Natural Language Processing for Question Answering (NLP4QA)

The recent developments in Question Answering have kept with open-domain questions and collections, sometimes argued as being more difficult than narrow domain-focused questions and corpora. The biomedical field is indeed a specialized domain; however, its scope is fairly broad, so that considering a biomedical QA task is not necessarily such a simplification over open-domain QA as represented ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2023

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2023.3304415